On The Classification Of Indic Languages
نویسنده
چکیده
Language, as part of human expression, may be viewed in analogy with genetic expression. Evolution of language is a result of complex temporal and spatial processes where, if one could aggregate the processes, one may speak in terms of parent traits and the resultant descendent traits. Insights from the theory of non-linear dynamics indicate that the multitude of interactions amongst speakers would lead to the formation of just a few languages. Strongly interacting systems of very many components, like assemblies of neurons or human speakers, have only a few stable interaction states, called attractors, associated with their behaviour,1and these, for speakers, are the various languages. In evolving systems, the nature of these stable states will also change. This is how isolated languages can be seen to change. But more significant than this process is the change due to interaction with other languages. With this background it is clear that a correct view of language evolution is within the framework of other interacting languages. But for about one and a half centuries, language evolution has been studied using models inspired by early, mechanistic physics. Like a physical system that evolves due to radiation and other incident forces, languages were taken to change spontaneously. The spread of languages was explained by another mechanistic metaphor, namely, that of transfer of populations and invasions. This led to models of language families. The German philologist August Schleicher pioneered the tree approach in the 1860’s which assumes that when populations are isolated their speech get increasingly differentiated until they become distinct languages; this assumption allows one to set
منابع مشابه
A Comprehensive Analysis of Stemmers Available for Indic Languages
Stemming is the process of term conflation. It conflates all the word variants to a common form called as stem. It plays significant role in numerous Natural Language Processing (NLP) applications like morphological analysis, parsing, document summarization, text classification, part-of-speech tagging, question-answering system, machine translation, word sense disambiguation, information retrie...
متن کاملLexical Semantics and Selection of TAM in Bantu Languages: A Case of Semantic Classification of Kiswahili Verbs
The existing literature on Bantu verbal semantics demonstrated that inherent semantic content of verbs pairs directly with the selection of tense, aspect and modality formatives in Bantu languages like Chasu, Lucazi, Lusamia, and Shiyeyi. Thus, the gist of this paper is the articulation of semantic classification of verbs in Kiswahili based on the selection of TAM types. This is because the sem...
متن کاملIndica, an Indic preprocessor for TEX A Sinhalese TEX System
In this paper a two-fold project is described: the first part is a generalized preprocessor for Indic scripts (scripts of languages currently spoken in India—except Urdu—, Sanskrit and Tibetan), with several kinds of input (LTEX commands, 7-bit ascii, CSX, ISO/IEC 10646/unicode) and TEX output. This utility is written in standard Flex (the gnu version of Lex), and hence can be painlessly compil...
متن کاملA Text Input Scheme for Indic Languages with Large Numbers of Print- able Characters
This paper discusses design and development of a text-input scheme for phonetic Brahmic languages with a large number of printable characters. We devise an input scheme for an exemplar Indic language with the understanding that the findings are generalizable to other Indic languages. Our results show that a casual user is able to type at a reasonable speed with our approach.
متن کاملThe Festvox Indic Frontend for Grapheme-to-Phoneme Conversion
Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framewor...
متن کاملAnalysis of Phonetic Matching Approaches for Indic Languages
Phonetic matching plays an important role in multilingual information retrieval, where data is manipulated in multiple languages. User needs information in their local language which may be different from the language where data has been maintained. In such an environment, we need a system which matches the strings phonetically irrespective of errors either exactly or approximately. There are m...
متن کامل